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Abstract 

This preliminary report addresses the expressive power of unit resolution regarding input data 
encoded with partial truth assignments of propositional variables. A characterization of the func- 
. tions that are computable in this way, which we propose to call propagatable functions, is given. By 

establishing that propagatable functions can also be computed using monotone circuits, we show 
' that there exist polynomial time complexity propagable functions requiring an exponential amount 

^ ' of clauses to be computed using unit resolution. These results shed new light on studying CNF en- 

codings of NP-complete problems in order to solve them using propositional satisfiability algorithms. 
A paper is being drafted, which aims to present the concepts introduced in the present report and 
the underlying scientific issues in a more simple way. 

<C ■ 1 Introduction and motivations 

, ' Unit resolution is a key feature of state of the art SAT solvers |13) [7] [5] , where it speeds up the search 

for solutions and inconsistencies. 
^ ^ It is well known that different CNF representations of a given problem do not always allow unit reso- 

QQ ■ lution to deduce the same information. For example, the CNF encoding for pseudo Boolean constraints 

0^ . proposed in [3j allows unit resolution to restore generalized arc consistency. This is not the case with 



the encoding proposed in [16j, which does not allow unit resolution to deduce as much information as 
the former encoding does. As a manner of speaking, the expressive power of unit resolution is best 



. exploited using the encoding proposed in [3], with notable consequences on the resolution time. 

Two important related questions are "What information can be deduced by unit resolution?" and 
"Which clauses are required in order to allow this information to be deduced?" 

These questions are strongly connected to the characterization of the application field of SAT solvers: 
^ , "Which problems can be solved as efficiently using a SAT solver as using a specialized solver?" and "How 

^ I to encode these problems into CNF formulae for optimal resolution time?" 

In this paper, we are interested in the functions that can be calculated by means of unit resolution. 
Studying the expressive power of unit resolution requires characterizing these functions, which will be 
called propagatable functions, and specifying the size of the formulae required to compute them. 

Section [2] presents the three main research directions related to the expressive power of unit resolu- 
tion. Section [3] introduces the concept of propagators and propagatable functions as a formal framework 
where unit resolution is a computing model. This section also presents theoretical results that will be 
used in section [H where the expressive power of unit resolution is compared to the one of monotone 
Boolean circuits. Section [5] ends the paper with a synthesis of the results, which highlight their impli- 
cations regarding the efficiency of unit resolution as a filtering technique in SAT solvers. 

2 Related works 

There are at least three research directions related to the study of the expressive power of unit resolution. 



1 



The first one aims to identify the classes of formulae for which unit resolution is a complete refutation 
procedure in the sense that it produces the empty clause if and only if the input formula is not satisfiable. 
For example, this property holds for the formulae containing only Horn clauses [9j. This approach differs 
from that proposed in this paper since it considers the formulae as input data instead of computing 
systems. 

The second direction aims to characterize the complexity of determining whether a given formula 
can be refuted by unit resolution or not. This decision problem denoted UNIT is known to be P-complete, 
meaning that for any problem vr with polynomial time complexity, there exists a log space reduction 
from TT to UNIT |10) . Circuit value and monotone circuit value, which consist to determine the output 
value of a Boolean circuit (monotone Boolean circuit, respectively), given its input values, are both 
P-complete too [8j. Regarding the complexity theory, UNIT, circuit value and monotone circuit value 
have then the same expressive power. In the present paper, a different point of view is adopted. The 
CNF formula is not the input data of a program, but the program itself. The input data is a partial truth 
assignment encoded in a natural way, i.e., each input variable can be either assigned to true, assigned 
to false, or not assigned. Similarly, the Boolean circuits are not considered as inputs of a program, 
but as a programs by themselves. In this context, circuit value and monotone circuit value have not 
the same expressive power for at least two reasons : (1) monotone circuits can only compute monotone 
Boolean functions, while any Boolean function can be computed using a general Boolean circuit, and 
(2) there exist monotone Boolean functions which can be computed by polynomially sized Boolean 
circuits, but requiring an exponential number of gates to be computed using monotone circuits |15| . 
One of our results is that used as a computation model, unit resolution with natural input encoding 
has the same expressive power as monotone Boolean circuits, then less expressive power than general 
Boolean circuits. Obviously, the input encoding plays a central role is this result since by the use of 
another encoding where all input variables are assigned, unit resolution can easily simulate any Boolean 
circuit. Nevertheless, the natural encoding is the one used internally in the SAT solvers. 

The third line is related to the search for efficient CNF encodings of various problems in order to solve 
them thanks to a SAT solver. Because unit resolution is implemented efficiently in SAT solvers, many 
works aim to find encoding schemes which allow unit resolution to make as many deductions as possible. 
In [6], a CNF encoding for enumerative constraints is proposed, which allows unit propagation to make 
the same deductions on the resulting formula as restoring arc consistency on the initial constraints 
does. This work was innovative because with the previously known encodings, unit propagation had 
less inference power than restoring arc consistency, which is the basic filtering method used in constraint 
solvers. It has been followed by various similar works on other kinds of constraints such as Boolean 
cardinality constraints [2j and pseudo-Boolean constraints |4], while in [Ij, a general way to construct 
such an encoding for any constraint is proposed. Today, it has become customary, when a new encoding 
is proposed, to address the question of the behavior of unit resolution on the obtained SAT instances. 
The problem is that some of these encodings produce a prohibitive number of clauses. This is why some 
authors seek a trade-off between the size of the formulae and the inference power of unit resolution and 
other deduction rules implemented in SAT solvers, such as the failed literal rule jllj . For example, this 
approach is developed in [14] and |12| in the context of Boolean cardinality constraints. 

3 Propagatable functions 
3.1 Unit resolution 

This section recalls the terminology and the principles involved in unit resolution, and introduces the 
notations that will be used in the rest of the paper. 

A literal is either a propositional variable or a negated propositional variable. A CNF formula is 
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defined as a conjunction ci A . . . A Cfc of clauses, where each clause = li^i V ... V li,\ci\ is a disjunction 
of hterals. The size of a CNF formula is its number of literal occurrences. 

A truth assignment on a set of prepositional variables is a function mapping some of the variables 
in this set to truth values, i.e., true or false. These variables are said to be fixed to true or false. If 
a truth assignment does not fix all the variables, it is said to be partial; else it is said to be complete. 
In this paper, a truth assignment will be represented as a set of literals. Given a propositional formula 
(p and a truth assignment /, (f)\j denotes (f> 

Any CNF formula (p is said to be satisfied (falsified, respectively) by a truth assignment I if and 
only if I causes (p to evaluate to true (false, respectively) in the standard way. A CNF formula (j) is 
said to be satisfiable if and only if there exists a truth assignment that satisfy (p. Any complete truth 
assignment satisfying a CNF formula (p is called a model of (p. 

For convenience, a clause can be represented as a set of literals and a CNF formula can be represented 
as a set of clauses. 

Example 1. The CNF formula (a V 5) A (a V b) can be represented as {{a, b}, {a, b}}. 

Unit resolution is an inference technique which aims either to detect an inconsistency or to assign 
some variables, so as to simplify a CNF formula. As described in a standard way by Algorithm [T| 
until there is no empty clause and there is at least one unit clause (w) in the input formula, all the 
occurrences of the literal w and all the clauses containing the literal w are removed. 



input : (f) [CNF formula]; 

output: {(p,E) [(CNF formula, set of literals)] or ±; 

while {1^0 and 3{l} e do 

ip ^ (p \ {c : c e (pj e c} \ {c : c e 4>J e c} U {c \ {1} : c e 4>J e c}; 

E ^EU{1}; 
end 

if {} e then 

I return _L 
else 

] return {<p, E) 
end 

Algorithm 1: The standard algorithm for unit resolution 

In the following, we will consider another algorithm (Algorithm [3|) that will be called the alternative 
algorithm for unit resolution. This alternative algorithm return _L if and only if the standard algorithm 
return _L, else it returns the same truth assignment as the standard algorithm does. But contrary to 
the standard algorithm, it does not modify the input formula. 

The alternative algorithm repeat n + 1 propagation stages, where n is the number of variables in the 
input formula. Each of these propagation stage is performed by the procedure propagation (Algorithm 

ED. 

The standard and the alternative algorithms are strictly equivalent. The first one produces a literal 
w if there is a unit clause [w) in the simplified formula, that is if there is a clause [liM ■ ■ ■ y lq\/ w) 
in the input formula such that the literals li, . . . ,lq are previously produced. The second one produces 
the same literal in the same conditions, with the only difference that it does not modify the input 
formula. The standard algorithm return _L when an empty clause is produced. This occurs when there 
is some unit clause {w) and the opposite clause (?D) in the simplified formula. In the same situation, the 
alternative algorithm does not stop, but adds both the literals w and w in the set E. As the standard 
algorithm, it will return _L at the end of its execution. 
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input : (f),E [(cnf formula, set of literals)] ; 
output: F [set of literals]; 

foreach literal w in cj) such that w ^ E do 

foreach clause (Zi V • • • V V w) of (f) such that li, . . . ,lk & E do 
I F^FU{w} 

end 
end 

return F] 

Algorithm 2: The procedure propagation, which perforras a propagation stage of the alternative 
algorithm for unit resolution. 



input : (f) [cnf formula[; 
output: E [set of literals[ or _L; 

n <— the number of variables in c/>; 
repeat n + 1 times 

\ £^ ^ _B U propagation(0, i?); 
end 

if there exists v such that v,v d E then 

I return _L 
else 

I return E 
end 

Algorithm 3: The alternative algorithm for unit resolution 

Of course, the alternative algorithm could be optimized in such a way that it stops if the last 
propagation stage did not modify the set E, or if there are a literal w and its opposite id in E. One 
of these two events necessarily occurs during the first n + 1 propagation stages, because if E contains 
n + 1 literals, it necessarily contains two opposite literals. This optimization has not been done because 
this algorithm is not intended to be implemented. It is only a way to prove some theoretical results 
which will be presented later. 

In the following of the paper, given any CNF formula (p with n variables and any integer 1 < A; < n+1, 

• Uk{4>) will denote the set of literals produced at the k^^ propagation stage of algorithm [Sj 

• Ui,,k{4>) wiU denote U^^^lAi{(f))] 

• W(</>) will denote the result of unit propagation applied to </>, i.e., either ± or hli,,n+i{4')- 

Example 2. With (p = (a V6) A (6) A (aVcVd) as input, both algorithmsU\cmd\E return U{4>) = {b,a}. 
Regarding algorithmic ^i(0) = {b} andlA2{(l)) = {a}- 

3.2 Reified formulae 

This section introduces the notion of reified CNF formula, which will be subsequently used as a tool to 
prove several results. 

Informally speaking, the reified counterpart of any CNF formula (p, is a satisfiable CNF formula 
CJ = reif((/)) such that applying unit resolution on a simulates all the inferences produced by applying 
unit resolution on (p. 

Let var((/>) denote the set of the variables occurring in (p. 
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Definition 1 (reified formula). Let (p be a CNF formula with n variables. The reified counterpart of (p 
is the formula a = reif{(j)) obtained as follows: 

• There are 2n(n + 2) variables in a, namely 

var(o-) = U^e^^r(4>){vo,Vo, • • • , "^n+il 
Given any propositional variable v G var(cj), let 6{v) denotes and 6{v) denotes v~ . 

• a consists of the following clauses: 

(1) for any unary clause (w) of (p, {6{w)o) A {5{w)() V 5{w)i) 0; which will be called initialization 

clauses of rank and 1, 

(2) for any stage 2 < i < n + \, and any variable v of (j), {vj_i V vf) A {v^_i V v^), which will be 

called propagation clauses of rank i, 

(3) for any stage 2 < i < n + \, and for any non-unary clause q of (j), /\yj^qX{(l^hw) , where 

X{q,i,w) = 5{w)i\J \l 6{t)i^i, 

which will be called deduction clauses of rank i. 

For convenience, in the following of the paper, each time that a formula (p and it reified counterpart 
fj = reif((/>) will be considered, the propagation stages on (p will be numbered from 1, while the 
propagation stages on a will be numbered from 0. 

Example 3. Let (p = (a) A(aV6). The reified counterpart of (p can be decomposed as a = AfT(2) Acr(3), 
where: 

^(1) = (ao)^(ao Va^") 

CT(2) = (afv at) A V a^) A (6+ V 6+) A (6^ V b^) A (4 V a+) A V a^) A (6+ V 6+) A (6^ V b^) 

^(3) = K Vfe^) A(6r Va^) A(4 V6+) A(6^Va3) 

The stage 1 of unit resolution fixes a to true in (p. Accordingly, thanks to the initialization clauses, 
the stages and 1 of unit resolution fix and to true in a. 

At the stage 2, unit resolution fixes b to true in (p. Accordingly, at the stage 2 of unit resolution of 
(T, the variable 6^ is fixed to true thanks to the deduction clause (a^'^Vft^), and the variable is fixed 
to true thanks to the propagation clause (a^ V a^). 

At the stage 3, unit resolution fixes no new variable in (p. Thanks to the propagation clauses (a^Va|^) 
and V b't), the stage 3 of unit resolution on a fixes and to true. 

This example shows the roles of the three kind of clauses. The initialization clauses simulate the first 
stage of unit resolution, which consists to fix the variables occurring in unit clauses. The propagation 
clauses allow unit resolution to propagate the values that where previously assigned. For example, if 
the variable a is fixed to true in cp at stage 1, i.e., is fixed to true in a, then the clause (a^ V a^) 
ensures that is fixed to true in a at stage 2. The deduction clauses allow unit resolution to simulates 
in a the deductions that are made in (p. For example, if at stage 1, the variable a is fixed to true in 

^{5{w)i) would have the same effect in only one propagation stage, but the reified formula is intentionally tailored in 
such a way that the variables vf and v~ are fixed at the [i + l)**" propagation stage. 
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and if there is a clause (a V b) in (j), then the clause {af V 6^) of cr allows unit resolution to fix 6^ to 
true, which indicates that unit resolution fixes b to true in (p. 

Note that the proposed model of reified formula is not optimal in the sense that it usually involves 
redundant clauses and useless clauses. Our purpose is to provide a tool for proving theoretical results 
which will be presented in the following of the paper. In this context, the relevant property of the 
reified counterpart of a formula <j) is that its size is polynomially related to the size of (p. Namely, if 
there are n variables and k clauses of size at most p in 0, then there are 0{ii?k) clauses of size at most 
p in the reified counterpart a of (p, because for any of the 0{n) propagation stages, a contains 0{n) 
propag ation clauses and 0{kn) deduction clause^ , and because the number of literals in any clause of 
a cannot exceed the size of the longest clause of (p. 

Definition 2 (reified formula with injected variables). Let (p be any CNF formula with n variables and 
V C var((/)) be a set of propositional variables. The formula 

reif((/>, V) = reif((/>) A ( /\ ((U V vf) A V ))) 

vev 

is said to be the reified counterpart of (p with injected variables V. The clauses added to reif(0) will be 
called injection clauses. 

Lemma 1. Let (p be any CNF formula with n variables. Let a = reif(0) and i be any integer such that 

< i < n + 1. For any variable v G vai{(p), applying unit resolution on a can fix and/or v~ only at 
the propagation stage i, and only to true. 

Proof. This property can be proved by induction on i. The only variables which can be fixed at the 
first propagation stage, i.e. i=0, are in {v^ ,Vq ,v G \a.T{(p)}, because no other variables are in unary 
clauses, and these variables can only be fixed to true, because they occur positively. Now, given any 

1 < i < n + 1, let us suppose the the property hold until the [i — 1)*'^ propagation stage (which 
implies that no variable in {vf,v^,0 < i < i — l,v S var (</>)} has been fixed negatively). The only 
way for unit resolution to fix variables in {v^,v^,v £ vari^cp)} is through clauses involving variables 
in {vf_-^^,v~_^,v G var((/))}, which, by induction hypothesis, can only be fixed at the propagation stage 
i — 1. Because the variables in {vf_-^^,v~_^,v G var((/))} occur positively in these clauses, they can be 
only fixed to true. □ 

Theorem 1. Let (p be any CNF formula with n variables and a = reif{(p) be the reified counterpart of 
(p. The following properties hold: 

L a is satisfiable. 

2. For any variable v G var(0), and any integer 1 < k < n + 1, the two following properties hold: 

(a) v'l G Uk{a) if and only if v e Ui„k{<p); 

(b) G Uk{a') if and only if v £ Ui,,k{4')- 

Proof. The first property arises because each clause of a contains at least one positive literal. 

The second property can be proved by induction on k. For sake of brevity, let us reformulate it as 
follows: for any variable v G \ai{(p)^ any integer 1 <k < n+1, and any literal w G {v,v^, 5k{w) G Ukicr) 
if and only if w G Ui,,k{(p). 

Clearly, the property holds for k = 1 because there is a clause (w) in (p if and only if there is a 
clause {6q{w)) and a clause {6o{w) V 6i{w)) in a. Now let us suppose that the property holds until the 
propagation stage A; — 1, A; > 1. 

■^Because each of the n variables occurs at most in k clauses. 
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Let us suppose that 5k{w) G lAk{(y)- Then, according to the lemma [H one of the two following 
conditions hold: 

1. There is a propagation clause {5k-i{w) V 5k{w)) in a and 5k-i{w) £ Uk{(^)- By induction 
hypothesis, w G l^L.k-ii'P)} then w G hli,,k{4>)- 

2. There is a deduction clause {5k-i{li) V • • ■\/ 5k-i{lq)y 5k{w)) in a and . . . , 5k-i{lq) G 
Uk-i{cr). This means that there is a clause (/i V- • -VlqVw) in (/>, and, by induction hypothesis, 
Ti, . . . , Iq G Then w G Ui„k{4'). 

Let us suppose that w G lAi^^k{4')- Then, according to the principle of unit resolution, one of the 
two conditions hold: 

1. There is a clause [w) in (j). Then, as shown in the first part of this proof, 5i{w) G Ui{(t). 
Thanks to the propagation clauses {6i{w) V 52{'w))^ . . . , {6k-i{w) V 6k{w)), Sk{w) G Ukicr). 

2. There is a clause {h V ■ ■ ■ \/ Iq V w) in and an integer 1 < i < k — 1 such that h, . . . ,lq G 
hli,,i{(j)). By construction of cr, there is a deduction clause V • • • V 5i{lq) V 5iJ^i{w)) in 
cr, and by induction hypothesis, . . . ,5i{lq) G Ui{a). Then G UiJ^i{a). Either 
because i+1 = A; or thanks to the propagation clauses [5i{'w)\/ 5iJ^i{w)) , . . . , {6k-i{w)\/5k{w))^ 
5k{w) G Uk{(T). 

□ 

As an interesting corollary of Theorem [H the failed literal rule which is a speed up technique 
implemented in some modern SAT solver, can be simulated by unit propagation. Given a formula a 
and a literal Z, the failed literal rule aims to test if / must be fixed. Unit resolution is applied to o" A (/) 
{a A (/), respectively). If an inconsistency is detected then I {I, respectively) is fixed to true. The 
same result can be obtained by applying unit resolution on reif((/> A (/)) A Ai;evar((/))(^rtfi ^ I'n+i 
(reif((/> A (/)) A At,gvar(,^) (^n+i V v-^^ V 0, respectively). 

Theorem 2. Let (/) be any CNF formula, V C var((/)) be a set of propositional variables, and I G ly 
a truth assignment. U{{reif{(j),V))\j) = Z//(reif (c^l/)), i.e., unit resolution have the same effect on 
U{(jeif{(j),V))\j) as it does on Z//(reif(^|/)). 

Proof. For any literal v £ I {v £ I, respectively), the two first stages of unit resolution applied to 
(reif(^, y))|/ produces the literals respectively), which are the literals produced from the 

clauses [v^ V v^)^ (^o") ((^o" V v^), {vq), respectively) of reif((/)|/). The other stages of unit resolution 
behave similarly in the two formulae, because the same clauses are involved. □ 

3.3 Computing with unit resolution 

This section explains how unit resolution can be used to compute functions. 
3.3.1 Definitions and terminology 

Definition 3 (Propagator). Let cp be a CNF formula, var{(j)) be the set of propositional variables occur- 
ring in (j), V CI var((^) a set of propositional variables, and s (z V a propositional variable. The triplet 
P = {(p, V, s) is called a propagator. The size of P is the size of the formula (p. 
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A propagator V, s) can act as a computer in the following way: 

• the input data is a partial truth assignment I of the variables in V, i.e., some variables are 
assigned to true, some are assigned to false, and the other are not assigned, 

• the output can take four possible values according to the result of applying unit resolution: 

- fail if = ±, 

- true if U{(l)\i) 7^ -L and s G U{(f)\i), 

- false if U{(f)\i) 7^ _L and s G U{(f)\i), 

- na if U{(l)\i) 7^ ± and s ^ and s ^ U{4)\i). 

Formally, {4>, V, s) computes a function / with domain ly and codomain {fail, true, false, na}, 
where ly denotes the set of all the consistent partial assignments on V, i.e., {I C V LI {v,v e V},yi G 
IJ^I}. 

Conversely, given a set V of propositional variables and any function / with domain D C ly and 
codomain {fail, true, false, na}, the following issues can be addressed: 

1. Can / be computed by a propagator ? 

2. If yes, how many clauses are required to compute / using unit resolution ? 

These questions are important because in a SAT solver, unit resolution is used both for detecting 
inconsistencies (the fail answer) and for inferring new information (the true or false answers), with 
the effect of accelerating the resolution. It is then useful to use concise CNF encodings which allow unit 
resolution to achieve as many deductions as possible. 

Definition 4 (reified propagator) . Let P = {cp, V, s) be a propagator. The reified counterpart of P is 
reif(P) = (-0, y, s*™^, s^^^^^, s^^''), such that s*™^ is the variable s^_^_i, s^^^^^ is the variable s~_,_i, s^^'' is 
new fresh variable, and 

V = reif(</),y)A( /\ (^A^A./"^^^)) 

«Gvar((/>) 

By construction of the formula ip, given any I € Xy, applying unit resolution to ip\j never returns 
_L and simulates unit resolution on cpli in the following sense: 

• unit resolution on 0|/ returns _L if and only if unit resolution on ipli produces s^^'' (i.e. fixes to 
s^^'' to true); 

• unit resolution on 0|/ produces s if and only if unit resolution on ip\i produces s*™^; 

• unit resolution on (f)\j produces s if and only if unit resolution on tpli produces s^^^^. 

Definition 5 (filtering function). Let V be a set of propositional variables. Any function f with domain 
D C ly and codomain {fail, true, false, na} is called a filtering function. 

Definition 6 (matching function). Let V be a set of propositional variables. Any function f with 
domain D C ly and codomain {yes, no} is called a matching function. 

Any filtering function can be specified with three matching functions in the following way: 

Definition 7 (matching functions related to a filtering function). Let f be a filtering function with 
domain D C ly, V € {vi, . . . ,Vn}- The three matching functions related to f are defined as follows: 
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• Z^*^'^ with domain D and codomain {yes, no}, such that for any I E D, f^^^^{I) = yes if and only 
if f {I) = fail, 

• /f^^ with domain D' = {I e D, f{I) fail}, such that for any I G D', /*'""^(/) = yes if and 
only if f{I) = true, 

• f^^^^ with domain D' = {I e D, f{I) ^ fail}, such that for any I G D' , /faise^-j^ ^ ^^^^ 
only if f{I) = false. 

Definition 8 (monotone matching function). Given the order relation no <m yes, any matching 
function f with domain D is said to he monotone if for any I,J€D such that I Q J, f{I) <m f{J)- 

Now we will formally define the filtering and the matching functions that are computable using unit 
resolution. 

Definition 9 (propagatable filtering function). Any filtering function f is said to he propagatable if 
and only if there exists a propagator which computes f . 

Now, two ways will be considered to compute matching functions with unit resolution. The first 
one consists in using a variable as output under the assumption that _L is never returned. The second 
one consists in considering that the output value yes when _L is returned. 

Definition 10 (propagatable matching function). Any matching function f with domain D C Xy is 
said to he propagatable if there exists a propagator {(j), V, s) such that for any I € D, the two following 
conditions hold: 

1. U{<p\i)^^, 

2. f{l) = yes if and only if s &U{(f)\i). 

Definition 11 (z^-propagatable matching function, z/-propagator) . Any matching function f with do- 
main D C ly is said to be //-propagatable if there exists a CNF formula cf) such that for any I & D, 
f{I) = true if and only iflA{(f)\i) = _L. The couple {V,(j)) is said to he a v-propagator computing f. 

To end this necessary sequence of definitions, let us address the notion of space complexity of 
propagatable functions. 

Definition 12 (polynomially propagatable functions). Let F he a family of filtering functions or a fam- 
ily of matching functions. T is said to he polynomially propagatable ( or polynomially v -propagatahle, 
if applicable) if and only if any function f ^ T with domain D C .y^j. can be computed using a 

CNF formula of size polynomially related to n. 

3.3.2 Propagability versus //-propagability 

In this section, we will show that propagatable and z/-propagatable matching functions have the same 
expressive power and similar space complexities. 

Theorem 3. Let f he a matching function, f is propagatable if and only if f is v -propagatahle. 

Theorem 4. Let f be a propagatable matching function, f is polynomially propagatahle if and only if 
f is polynomially v -propagatahle. 

Proof. 



9 



1. propagatable =^ z^-propagatable. 

Let / be a propagatable matching function with domain D, and P = {(j),V,s) be a propagator 
which computes /. Clearly, for any partial truth assignment I ^ D, applying unit resolution to 
the formula A (s))\j returns _L if and only if /(/) = yes. 

2. i/-propagatable =^ propagatable. 

Let / be a z^-propagatable function with domain D C ly ^ind (j) a CNF formula including n 
variables, such that for any I £ D, applying unit resolution to (pli returns _L if and only if 
f{I) = yes. 

Our aim is to build a new formula ip such that for any I £ D, applying unit resolution to does 
not return ± but fixes a variable s to true if and only if applying unit resolution to returns 
±, in such a way that the propagator {ip, V, s) computes /. 

The formula can be obtained as follows: 



V' = reif(0,y) A( /\ (n++i A n;+i A s)) 

n£var{(?!>) 

The variable s will be fixed to true if and only if unit resolution on (p fixes both a variable uf 
and a variable u~ to true. According to the theorems [1] and [21 this occurs if and only if applying 
unit resolution to returns _L. Then, {ip,V,s) is a propagator which computes /. 

Because the two transformations have polynomial space complexity, both theorems[3]and[3]hold. □ 



3.3.3 Filtering functions versus matching functions 

In this section, we will show that without loss of generality, studying the space complexity of propagat- 
able filtering functions reduces to studying the space complexity of propagatable matching functions. 

Theorem 5. Any filtering function f is propagatable if and only if the three related matching functions 
Z*"^"*^; f^^^'^^, and /^^^^ are propagatable. 

Theorem 6. Any filtering function f is polynomially propagatable if and only if the three related 
matching functions Z*'^™, f^'^^^'^^ and /^^'' are polynomially propagatable. 

Proof. 

1. filtering =^ matching 

Let / be a propagatable filtering function and f^"^^^, ^false^ ^^^^ yfail ^j^g related matching functions. 
Because / is propagatable, there exists a propagator {(j), V, s) that computes /. Then f^^^'^, f^^^^^^ 
and Z^*^'^ can be computed with the following propagators, respectively: 

(a) {(j),V,s) (which computes f^'^"^); 

(b) ((/> A (s V t), V, t) (which computes /^^'''''); 

(c) {ip, V, s^^''), where {tp, V, s^^^*^, s^^^'^'^, s^^'') is the reified counterpart of {(p, V, s). 

Clearly, f^^^'^, f^^^^^^ and /^^'' are propagatable. Now, because the size of tp is polynomially related 
to the size of (p, if / is polynomially propagatable then Z*"^"^, J^^'*^^, and f^^^^ are polynomially 
propagatable too. 
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2. matching filtering 

Let / be a filtering function with domain D C Xy^ V E {fi, . . . , Vn\ and /*'^"'^, f^^^^'^. and /^^^'^ the 
related matching functions. Suppose that f^^^^, f^'^^^'^^ and /^^'^ are propagatablc (polynomially 
propagatable, respectively). Now let us consider the three following propagators (with formulae 
of size polynomially related to n, respectively): 

(a) {(f)i,V,si), which computes J*™*^; 

(b) (02, V, S2), which computes /^^i^^; 

(c) (03, V, S3), which computes f^^'^. 

Let {ipi, V, sfil), {ip2, V, s*2''"^ 5^''^'% s^*"'^), {^3, V, s^5™^ s|^") be the reified counter- 

parts of these propagators. Without loss of generality, let us suppose that, except for the input 
variables in V, the formulae ipi,tp2,ip3 have no common variable. 

The function / can be computed (in polynomial space, respectively) using the following propa- 
gator: 

P = (^1 A A A (sf^ V s) A (sf^ V s) A (sp), V, s) 
Clearly, P computes /, which is then propagatable (polynomially propagatable, respectively). 

□ 



3.3.4 Boolean representations 

Given V = {vi, . . . ,Vn} a set of propositional variables, D C ly a set of partial truth assignments of 
V, f any matching function with domain D, and / any partial assignment in D, let us define: 

• the Boolean representation of J as /b = {xi, . . . , yi, . . . , y„) G {0, 1}" such as for any 1 < i < n, 
Xj = 1 if and only if Vi G /, and = 1 if and only if G J, 

• the Boolean representation of D as Db = {-^B,-^ G D}, 

• the Boolean representation of / as /b : -Db '-^ {0, 1}, such that for any I ^ D, fMi^n) = 1 if and 
only if /(/) = yes. 

Example 4. The following table gives an example of a matching function f and its Boolean counterpart 
/b- 



I 


In 


f{I) 


fMih) 


{vT, V2} 


(0,0,1,1) 


no 





{W, V2} 


(0,1,1,0) 


yes 


1 


m 


(0,0,1,0) 


no 





{V1,V2} 


(1,0,0,1) 


yes 


1 


{V1,V2} 


(1,1,0,0) 


yes 


1 


M 


(1,0,0,0) 


yes 


1 




(0,0,0,1) 


no 





M 


(0,1,0,0) 


yes 


1 


{} 


(0,0,0,0) 


no 






{{yi \/ s) f\ {v2 V s), {^1,^2}, s) is a propagator for f. 
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Example 5. The following table gives a matching function g which is not propagatable, and its Boolean 
counterpart ■ 

I Im gjl) gnih) 

{v} (0,1) yes 1 

[v] (1,0) no 

{} (0,0) yes 1 

There is no propagator for g because for any formulae 4>i C (j)2, ifU{<p2) 7^ -L then any variable 
fixed by unit resolution on (pi will be fixed on ^2 as well. It follows that the third line of the table is not 
consistent with the second one. 

3.4 Synthesis 

In this section, we first introduced the notion of filtering function as a general model of functions 
that can be computed by unit resolution. We then showed that any filtering function reduces to three 
matching functions, which are functions with binary codomain ({yes, no} without loss of generality) 
that can either be computed by unit resolution in two different ways: (1) unit resolution detects an 
inconsistency when the output value is yes, (2) it fixes a predefined output variable to true when the 
output value is yes. The main result of this section is that without loss of generality, studying the 
expressive power of unit resolution can be reduced to studying the tractability and the complexity of 
computing matching functions with unit resolution. As a corollary, in the sequel of the paper, only 
propagatable matching functions will be considered. 

4 Expressive power of propagators 

Using a complete truth assignment as input values, unit resolution has the same expressive power as 
Boolean circuits, because elementary gates can be directly translated into clauses. In this section, we 
will show that if some input variables are not fixed, the expression power of unit resolution fall down 
to the expression power of monotone Boolean circuits, i.e., circuits with only or / and gates. 

4.1 Boolean circuits 

A Boolean circuit is a directed acyclic graph representing a Boolean formula. It is said to be monotone 
when it contains only and and or gates. 

In the following, a Boolean circuit will be represented by a triplet {L,G,w), where L is a set of 
input labels, w is the output label, and G is a set of gates. A or gate (and gate, respectively) is denoted 
or{E,t) {and{E,t), respectively), where E is the set of input labels and t is the output label of the 
gate. A not gate is denoted not(g, t), where q is the input label of the gate and t is its output label. 

Given a Boolean circuit C = {{ei, . . . ,en},G,w) and any x = {xi, . . . , Xn) G {0,1}", let C{x) 
denote the output value of C under the assumption that its input values Formally, 
C{x) can be defined as val{w) such that for any 1 < i < n, val(ej) = Xi, for any gate or(E,t) G G, 
val(t) = Vee-B ^^K^)' §^^® aiid{E,t) G G, val(i) = /\^^^val{e), and for any gate not(g,t) G G, 

val(i) = -^val{q). 

For convenience, an additional gate tie(g,t) will be used to connect two nodes q and t in such a 
way that val(t) = val(q). 

Given any Boolean function / with domain D C {0, 1}" and codomain {0, 1}, any Boolean circuit 
G with n inputs is said to compute f if and only if for any x G D, C{x) = f{x). 
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4.2 Circuits computing propagatable functions 

Because the Boolean counterpart of any matching function is a Boolean function, it can be computed 
by a Boolean circuit. In this section, we will show that any matching function is propagatable if 
and only if its Boolean counterpart can be computed using a monotone circuit. Furthermore, we will 
establish a relationship between space complexity of propagatable matching functions and monotone 
circuit complexity. 

Theorem 7. For any matching function f, if there exists a monotone circuit with n gates, each of 
them with at most k inputs, which computes /b, then there exists a propagator with 0{nk) clauses, 

which computes f. 

Proof. Let us consider any matching function / with domain D C Xy, V = {vi, . . . ,Vn}, and any 
monotone circuit Q = {L,G,Uk) computing /b. 
Without loss of generality, let us suppose that 

• the set of input labels of Q is L = {ei, . . . , e2n}, 

• the set of the output labels of the gates of Q is {ui, . . . ,Uk}. 

Let r be a function that maps the labels of Q to propositional literals such that 

r(ei) = Vi,l < i < n 

T{e-i) = vTTi, n + 1 <i <2n 

T{ui) = Vn+i, 1 <i < k - 1 

, riuk) = s 

For any gate g = and({Q;i, . . . , a^}, t) G G, let 7r(5) = (r(Q! i) V • • • V t(q;^) V r(t)). 

For any gate g = OT{{ai, . . . ,am},t) G G, let Tr{g) = AI^i (r(ai) V r(t)). 

Let 

0= /\A9)- 
geG 

Now let us show by induction on the number k of gates in Q that the propagator P = (0, F, s) 
computes /. 

The property holds for fc = because if the circuit Q has no gate, the output label is one of the 
input labels or ei+„ related to the input variable Vi G V. If the input label is e, then the propagator 
{{},V,Vi) computes /. If the input label is ei+„ then the propagator {{vi V s),V,s), where s is a new 
fresh variable, computes /. 

Now, let us suppose the the property holds for any circuit with less than k clauses, k > 0. Let 
Q = {L, G, u) be any fc-gates monotone Boolean circuit which computes the Boolean counterpart /b of 
/ with input variables {vi, . . . , Let g be the output gate of Q. Let cci, . . . , am be the input labels of 
g. For any 1 < f < m, let Qi = {L, G\{g}, Ui). By induction hypothesis, each Qi computes the Boolean 
counterpart f^i of the matching function /j computed by the propagator P^ = {(f) \ 7r{g), V,T{ai)). 

Let us consider two cases: 

1. The output gate oi Q is g = and({ai, . . . , am}, u)- 

Because of the nature of g, for any / G Zy, fvil) = 1 if and only if for any 1 < i < m, fi{lM) = 1- 
Because of the nature of irig), f{I) = yes if and only if for any 1 <i <m, T{ai) G U{{(j)\TT{g))\i). 
Then /(/) = yes if and only if f^ih) = '^- 

2. The output gate of Q is ^ = or({ai, . . . , am)-, u). 

Because of the nature of g, for any / G Xy, = 1 if and only there exists 1 <i <m, such as 

fii^M) = 1- Because of the nature of TT{g), f{I) = yes if and only if there exists 1 < i < m, such 
that T{ai) G U{{4> \ T^{g))\i). Then /(/) = yes if and only if /b(/b) = 1- 
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□ 

Example 6. The circuit of the figure{l\ can be translated into a CNF formula (j) in the following way: 

- the gate and({ei, 62}, ui) produces the clause {vi V 7)2 Vwa); 

- the gate or({iii, 64}, 7x2) produces the clauses [v^ V s) and [v2 V s). 

This circuit computes the Boolean counterpart /b of the function f computed by the propagator 
{4>,{vi,V2},s). 

ei{vi) . 

62(^2) . 

63 (W) . . 

ei{v2) , 

Figure 1: A monotone circuit computing the Boolean counterpart of a propagatable function. 

Theorem 8. For any matching function f, if there exists a propagator {(j), V, s) computing f , then there 
exists a monotone circuit with 0{n'^k) gates computing f^, where n is the number of variables and k 
the number of clauses in (p. 

Proof. Let {(j), V, s) be a propagator computing a matching function /. Clearly, the propagator P = 
{ip = reif(</), V), V, s^_^_^) computes / too. According to Lemma[T]and Theorem[Tl ijj can be decomposed 
as '(/'o A 7/^1 A • • • A ipn+i such that 

• ipQ contains the initialization clauses of rank of the reified counterpart of (j), 

• ipi contains the initialization clauses of rank 1 as well as the injection clauses, 

• for any 2 < i < i, ipi contains both the propagation clauses and the deduction clauses of rank i. 
The corresponding circuit Q will contain the following nodes: 

• two input nodes ot and oi related to each input variable t , with the convention that ot = 1 
if and only if t is assigned to true, and ot = 1 if and only if t is assigned to false; 

• one major node ov related to any variable v G var(^) that can be assigned to true by unit 
resolution, with the convention that ov = 1 \i and only if unit resolution fixes v to true; 

• some additional nodes, if applicable; 

Major nodes and additional nodes can be constant, i.e. permanently assigned either to or 1. The 
constant nodes are not explicitly represented in the circuit but are referenced in the sets Uq and f/i, 
respectively. 

The circuit Q consists of several layers Qi, . . . where each Qi simulates the stage i of unit 

resolution on tp\i for any / G Xy- 

At the first step of the construction, Uq is initialized with the nodes ov^ {ovq , respectively) for 
any variable v G var((/)) such that {vq) {{vq), respectively) does not occur in ipQ, and U\ is initialized 
with the nodes ov'^ (oWq^, respectively) for any variable Vq {v^ , respectively) such that the clause {vq) 
{{vq), respectively) occurs in tpQ. 
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Each of the next steps builds Qi in such a way that it simulates the effect of unit resolution applied 
to ipi- This is done as follows: 

For each variable v oi (f) and for each variable u G {vf ^'^i}^ let C be the set of clauses of ipi 
containing n, simplified by removing the clauses containing a literal iu such that ow G Uq and removing 
any literal w such that ow £ Ui from the other clauses. 

If the set C is empty, which means that unit resolution cannot fix u, then ou is added to Uq. If C 
contains a clause (u), which means that unit resolution will always fix u to true, then 0u is added to 
Ui. If C contains only one clause (tD V n), meaning that u is fixed to true if and only if w is previously 
fixed to true, the connection tie{ow,ou) is produced. If C contains only one clause with more than 
two literals like (Wi V • • • V Wk V u), meaning that u is fixed to true if and only if wi and ... and Wk 
are previously fixed to true, the gate and({oi(;i, • • • ,<>'Wk},ou) is produced. 

In the other cases, i.e., when there are several clauses which can allow unit resolution to fix u, an 
additional node ac is created for each clause c G C. For any binary clause (-zD V u) G C, the gate 
t±e{ow, ac) is produced. For any other clause (wi V • • • V ItJfc V m), the gate and({oi«i , • • • , owk}, ac) is 
produced. Then the gate or({ac,c G C},ou) is produced, in such a way that val(ou) = 1 if and only if 
unit resolution fixes u to true. 

Because each sub-circuit Qi simulates exactly the effect of unit resolution on the corresponding 
formula i/^i, the value of the output node os^_^_i will reflect the value of the output variable after 
all propagation stages on ijj have been made. 

The number of gates in the circuit is linearly related to the number of clauses in the reified coun- 
terpart of (I), which is 0{n'^k). □ 

ExEimple 7. Let us consider the propagator ((a V 6 V c), {a, 6}, c) . At the first stage of the construction, 
Uq = {ocq ,0Cq}, and U\ = {} because tpo is empty. The input nodes of the circuit are oa,oa,ob, and 
Ob. 

The first layer of the circuit is based on: 

injection clauses 
' " ^ 

V'l = (a V a'l) A (a V aj") A (6 V b'l) A (6 V b'^) 

It consists in the connections tie(oo,oa^), tie(oa,oa^), tie(o6, ofe]*"), tie(o6,o6j~). The variables 
oc^ and oc^ are added to Uq. 
The second layer is based on: 

propagation clauses 

, — — . 

ip2 = {at V at) A (aj" V 02 ) A {b^ V b^) A {b^ V 62 ) A {at V c^) A V C2 ) A 

deduction clauses 

, — — — — — — — — ^ 

{a]; Vbt y ct) A (aj" V q V b^) A {bf V cj" V a^) 

The two last propagation clauses and the two last deduction clauses are ignored because oc^ and 
ocj" are in Uq. The four first propagation clauses are translated into tie{oaf ,oat), tie{oa^ ,002), 
tie(o6^, 06^), tie(o6^,o6^). The first deduction clause is translated into the gate and({a]^, 6^}, cj). 
C2 is added to Uq. 

The third layer is based on: 

propagation clauses 

, — — — — ^ 

tP3 = {at V at) A (02 V ) A V &+) A (6^ V 63 ) A (4 V c+) A {c^ V Cg ) A 

deduction clauses 

, — — — — — — — — V 

(02" V 6+ V c+) A (02" V C2- V 63-) A (6+ V C2- V at) 
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The propagation clause (c^Vcj^) is translated into the connection tie(oc^, ai), the deduction clause 

(flg V 6^ V c^) is translated into the gate and({a2 , ^2'}' '^2)) (md the clause or{{ai,a2},oc'^) is added, 
in such a way that ocjj' is set to 1 either i/oc^ is set to 1 or if both oa^ and 06^ are set to 1, i.e., if at 
stage 2 of unit resolution, either c is fixed to true or a and b are fixed to false and true, respectively... 

A part of the corresponding circuit is given Figure (Recall that this circuit is obtained from a 
reified formula, which, as mentioned above, presents some redundancies.) 




Figure 2: A part of the circuit related to the example [71 



Theorem 9. Let f be any matching function with domain D. f is propagatable if and only if it is 
monotone. 

Proof. Recall that any Boolean function h with domain Dh is said to be monotone if for any z, i € D^, if 
z <B t then h{z) <b h{t), where the ordering relation <b is defined as follows: <b 1, <b 0, 1 <b 1, 
(zi, . . . Zn) <b ih, ■■■tn) if and only if Zi <b <i < n. 

Now, given any matching function / with domain D, because for any I, J € D, I <^ J if and only 
if -^B <B •^B, /b is monotone on if and only if / is monotone on D. 

Let / be any monotone matching function with domain D. Because /b is monotone, it can be 
computed by a monotone circuit. It follows from Theorem [7] that / is propagatable. 

Now let us consider any propagatable matching function / with domain D. It follows from Theorem 
[8] that /b can be computed by a monotone circuit, which implies that /b is monotone. Then / is 
monotone. □ 

Theorem 10. Any family T of propagatable functions is propagatable in polynomial space if and only 
if the family of the Boolean counterparts of T has polynomial space monotone circuit complexity, i.e., 
these functions can be calculated by monotone circuits with a polynomial number of gates. 

Proof. The proof of theorem [7] shows how to create a propagator from a monotone circuit. Each and 
gate with n inputs is translated into one n-ary clause, and each or gate with n inputs is translated into 
n binary clauses. 

The proof of theorem [8] shows how to create a monotone circuit from a propagator P = {(j),V,s). 
This circuit is based on the reified counterpart ip of the formula (p. Each clause of ip with n literals is 
involved in at most n and gates, and each literal is involved in at most one or gate. 

□ 



5 Synthesis and perspectives 

Altogether, the results given in this paper provide important information about the expressive power of 
unit resolution. In particular, we can show that there exist polynomial time complexity propagatable 
functions that admit only propagators with an exponential number of clauses. 

As an example, let us consider the Boolean functions pm^"\ like perfect matching, such that for any 
n-bits Boolean encoding g of a graph G, pm^'"'\g) = 1 if and only if there exists a perfect matching for 
G, that is a set of edges that covers each vertex exactly once. 
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Next, let us consider the variants vprn^") such that 

• the domain D^^^^n) of vpm^") is the set {(xi, . . . , 0, . . . , 0), . . . , x„) G Z)p^(„)}, where 
Dp^(n) is the domain of pm^") , 

• for any b= . . . , x„, 0, . . . , 0) G D^pmi^), vpm(")(5) = 1 if and only if pm(")(xi, ...,Xn) = l- 

Now, let fpm*-"^ denote the matching functions related to vpm^"^. It is known that pm^"), then 
vpm^"'^ have polynomial time computational complexity but exponential monotone circuit complexity 
|15| . It follows from theorem [8] that fpm*-"^ are filtering functions requiring an exponential number of 
clauses to be computed using unit resolution. 

This means that although unit resolution has the same expression power as Boolean circuits regarding 
Boolean functions, it has a lower expression power, namely the expression power of monotone circuits, 
regarding filtering functions. 

This is both very interesting and annoying, because in SAT solvers unit propagation operates on 
filtering functions rather than Boolean functions. Maybe this potential weakness of unit resolution can 
be compensated for by other speed-up technologies. As a research perspective, this has to be verified. 
Meanwhile, in the field of encoding constraints into CNF, it would be very relevant to determine which 
problems can be solved as efficiently using a simple SAT solver, under CNF encoding, as using a dedicated 
constraint solver maintaining generalized arc consistency. This supposes knowing the complexity of the 
related filtering functions regarding unit resolution. 

At least two research directions follow from the ideas presented in this paper. The first one is to 
characterize the expression power of some other speed-up techniques used in modern SAT solvers, like 
clause learning. The second one consists in the research of deduction techniques that can polynomially 
compute any CNF encoded polynomial time complexity filtering functions. 
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